pads/t: A Language for Describing and Transforming Ad Hoc Data
نویسندگان
چکیده
pads/t is a declarative data description language paired with an error-aware transformation language. This pairing provides rich support for programming with ad hoc data sources. Such data is common in a wide range of domains including networking, financial analysis, biology, and physics. pads/t’s data description language is based on polymorphic, recursive, and dependent data types. From such descriptions, the compiler generates robust parsing and pretty printing code. The transformation language supports error-aware computation by automatically maintaining an association between data and a description of the errors in that data.
منابع مشابه
Ad Hoc Data and the Token Ambiguity Problem
PADS is a declarative language used to describe the syntax and semantic properties of ad hoc data sources such as financial transactions, server logs and scientific data sets. The PADS compiler reads these descriptions and generates a suite of useful data processing tools such as format translators, parsers, printers and even a query engine, all customized to the ad hoc data format in question....
متن کاملLAUNCHPADS: A System for Processing Ad Hoc Data
An Introduction to PADS. Ideally, any data we ever encounter will be presented to us in standardized formats, such as XML. Why? Because for formats like XML, there are a whole host of software libraries, query engines, visualization tools and even programming languages specially designed to help users process their data. However, we do not live in an ideal world, and in reality, vast amounts of...
متن کاملPADX: Querying Large-scale Ad Hoc Data with XQuery
This paper describes our experience designing and implementing PADX, a system for querying large-scale ad hoc data sources with XQuery. PADX is the synthesis and extension of two existing systems: PADS and Galax. With PADX, an analyst writes a declarative data description of the physical layout of her ad hoc data, and the PADS compiler produces customizable libraries for parsing the data and fo...
متن کاملA Generic Programming Toolkit for PADS/ML: First-Class Upgrades for Third-Party Developers
Domain-specific languages facilitate solving problems in a targeted domain by providing features particular to the domain. Declarative domain-specific languages have the additional benefit that users specify what something means rather than how to do something. As a result, the language compiler is free to choose the best implementation strategies and to generate multiple artifacts from a singl...
متن کاملFully Automatic Tool Generation from Ad Hoc Data
An ad hoc data source is any semistructured data source for which useful data analysis and transformation tools are not readily available. Such data must be queried, transformed and displayed by systems administrators, computational biologists, financial analysts and hosts of others on a regular basis. In this paper, we demonstrate that it is possible to generate a suite of useful data processi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005